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(57) Abstract: A method of designing, modelling 
or fabricating a communications baseband stack, 
comprising the steps of: (a) creating a description 
of one or more of the following parameters of 
the baseband stack: (i) resource requirements; 
(ii) capabilities; (iii) behaviour, and (b) using that 
description as an input to software comprising a 
virtual machine layer optimised for a communications 
DSP in order to generate an emulation of the baseband 
stack to be designed, modelled or fabricated. 
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METHOD OF DESIGNING, MODELLING OR FABRICATING A 
COMMUNICATIONS BASEBAND STACK 

FIELD OF THE INVENTION 

This invention relates to software for designing, modelling or fabricating a communications 
baseband stack. Communications baseband stacks are used for digital signal processing in 
communications equipment 

DESCRIPTION OF THE PRIOR ART 

Technology Background: digital signal processing, DSPs and baseband stacks. 
Digital signal processing is a process of manipulating digital representations of analogue 
and/ or digital quantities in order to transmit or recover intelligent information which has 
been propagated over a channel. Digital signal processors perform digital signal processing 
by applying high speed, high numerical accuracy computations and are generally formed as 
integrated circuits optimised for high speed, real-time data manipulation. Digital signal 
processors are used in many data acquisition, processing and control environments, such as 
audio, communications, and video. Digital signal processors can be implemented in other 
ways, in addition to integrated circuits; for example, they can be implemented by micro- 
processors and progranmmed computers. The term T)SP' used in this specification covers 
any device or system, whether in software or hardware, or a combination of the two, capable 
of performing digital signal processing. The term TDSP' therefore covers one or more digital 
signal processor chips; it also covers the following: one or more digital signal processor chips 
working together with one or more external co-processors, such as a FPGA (field 
progra mm able gate array) or an ASIC programmed to perform digital signal processing; as 
well as any Turing equivalent to any of the above. 
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In the communications sector, a DSP will be a critical element for a baseband stack as the 
baseband stack runs on the DSP; the stack plus DSP together perform digital signal 
processing. The term ^baseband stack' used in this specification means a set of processing 
steps (or the structures which perform the steps) including one or more of die following: 
5 source coding, channel coding, modulation, or their inverses, namely source decoding, 
channel decoding and demodulation. In addition, the term 'baseband stack' should be 
construed as including structures capable of processing digital signals without any form of 
down conversion; a software radio would include such a baseband stack. As will be 
appreciated by the skilled implementer, source coding is used to compress a signal (ie. the 
10 source sign2^ to reduce the bitrate, Channel coding adds stmctured redundancy to improve 
the ability of a decoder to extract information from the received signal, which may be 
corrupted. Modulation alters an analogue waveform in dependence on the infomiation to be 
propagated. 

15 Baseband stacks are found in mobile telephones (e.g. a GSM stack or a UMTS stack) and 
digital radio receivers (e.g. a DAB stack), as well as other one and two-way digital 
communications devices. The term ^communications' used in this specification covers all 
forms of one or two way, one to one and one to many communications and broadcasting. 
The terms designing* and 'modelling* typically includes the processes of one or more of 

20 emulation, resource calculation, diagnostic analysis, hardware sizing, debugging and 
performance estimating. 

The increasing complexity of communications systems places intense pressure on 
baseband stack development 

25 The complexity of communications systems is increasing on an almost daily basis. There are 
a number of drivers for this: traffic on the Internet is increasing at 1000% pa. Much of this 
(largely bursty) data is moving to wireless carriers, but there is less and less spectrum 
available on which to host such services. These facts have led to the use of ever more 
complex signal processing algorithms, in order to squeeze as much data as possible into the 
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smallest possible bandwidth. In fact, the complexity of these algorithms has been increasing 
faster than Mooters law ^e that computing power doubles every 18 months), with the result 
that conventional DSPs are becoming insufficient. For complex terminals, therefore, an 
ASIC must be produced to manage the vast parallel processing load involved. However, this 
5 is where the problems really begia. For not only are the algorithms used more complex on 
the signal processing front; the use of bursty, variable-QoS, often ephemeral transport 
channels, mandated by the move from primarily voice traffic to primarily Internet-related 
traffic, needs ever more sophisticated control plane software, even at Layer 1 (which requires 
hard real-time code). Conventional DSP toolsets do not provide an appropriate mechanism 
10 to address this problem, and as a result many current designs are not scalable to deal with 
'real worid' data applications. 

However, the high MIPs requirements of modern communication systems represent only 
part of the story. The other problem arises when a multiplicity of standards (e.g., GSM, IS- 
IS 136, UMTS, IS-95 etc.) need to be deployed witiun a single SoC (System on a Chip). SoC 
devices supporting multiple standards will be increasingly attractive to device vendors 
seeking to tap efficientiy different markets in different covmtries; also, it is expected that the 
next generation UMTS phones will have not only GSM (or current generation) capabilities 
but also added features, such as DAB (Digital Radio Broadcasting) receivers, hence requiring 
20 baseband stacks for UMTS, GSM and DAB. The complexity of communications protocols 
is now such that no single company can hope to provide solutions for all of them. But there 
is an acute problem building an SoC which integrates IP firom multiple vendors (e.g. the IP 
in the three different baseband stacks listed above) together into a single coherent package in 
iacreasingly short timescales: no commercial system currentiy exists in the market to enable 
25 multiple vendors' IP to be interworked Layer 2 and layer 3 software (generally, soft real-time 
code) is more straightforward, since it may simply be run as one process of many as software 
on a DSP or other generalised processor. But layer 1 IP (hard real time, often parallel) 
algorithms, preseiat a much more difficult problem, siace the necessary hardware acceleration 
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often dominates the architectute of the whole layer, providing non-portable, fragile, 
solution-specific IP. 

Overview of deficiencies in current models of baseband stack development 
In the past, baseband stacks have been relatively simple, the amount of required high-MIPs 
functionality has been relatively small and only modest amounts of multi-standard, multi- 
vendor integration have been perfomied. But as noted above, none of these now apply, (a) 
the bandwidth pressure means that ever more complex algorithms (e.g,, turbo decoding, 
MUD, RAKE, etc.) are employed, necessitating die use of hardware; (b) the increase in 
packet data traffic is also driving up the complexity of layer 1 control planes as more birth- 
death events and reconfigurations must be dealt with in hard real time; and (c) time to 
market, standard diversification and differentiation pressures are leading vendors to integrate 
more and more increasingly complex functionality (3G, Bluetooth, 802.11, etc.) into a single 
device in record time - necessitating die licensing of layer 1 IP to produce an SoC (system 
on chip) for a particular target application. 

Currentiy, tiiere is no adequate solution for tiiis problem; the VHDL toolset providers (such 
as Cadence and Synopsis) are approaching it from die Tjottom up' - their tools are effecthre 
for producing individual high-MIPs units of fimctionality (e.g., a Viterbi accelerator) but do 
not provide tools or integration for the layer 1 framework or control code. DSP vendors 
(e.g., n. Analog Devices) do provide software development took, but their real time models 
are static (and so do not cope well with packet data burstiness) and their DSPs are limited by 
Moore's law, which acts as a brake to thetc usefulness. Furdiermore, communication stack 
software is best modelled as a state machine, for which C or C++ (the languages usually 
supported by the DSP vendors) is a poor substrate. 

Detailed analysis of deficiencies in current models of baseband stack development 
Conventionally, baseband stack development for digital communications is fragmented and 
highly speciaUsed. For example, the initial development of the signal processing algorithms 
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that are the heart of a baseband stack is generally performed on a mathematical modelling 
environment (such as Matlab), with fitting to a particular memory and MEPs (Million 
Instructions per Second) budget for the final target DSP being done by skilled estimation 
using a conventional spreadsheet. Once this modelling process has been performed 
5 satisfactorily, code modules and infrastructure software for the stack will be written, adapting 
existing libraries where possible (and possibly an RTOS (Real-Time Operating System)). 
Then, a 'real time' prototype hardware system will be bmlt (somcthnes called a 'rack) in 
which any required hardware acceleration will be prototyped on PLDs (Programmable Logic 
Device) where possible. This will be tested off air, and necessary changes made to the code. 
10 Once satisfactory, the stack will be 'locked off and the final ASIC (Application Specific 
Integrated Circuit) (incorporating the hardware acceleration modules as on-chip peripherals) 
will be produced. The resultant baseband DSP or DSP components is then tested and then 
shipped. 

15 There are a number of problems with this 'traditional' approach. The more important of 
these are that: 

• The resulting stacks tend to have a lot of architecture specificity in their construction, 
making die process of 'porting' to another hardware platform (e.g. a DSP firom another 
manufacturer) time consuming. 

20 • The stacks also tend to be hard to modify and 'firagile', making it difficult both to 
implement in-house changes (e.g., to rectify bugs or accommodate new features 
introduced into the standard) and to licence the stacks effectively to others who may 
wish to change them slightiy. 

• Integration witii the MMI (Man Machine Interface) tends to be poor, generally meaning 
25 that a separate microcontroller is used for this fimction within the target device. This 

increases chip count and cost 

• The process is quite slow, with about 1 year minimum elapsed time to produce a 
baseband processor for a significantiy complex system, such as DAB (Digital Audio 
Broadcasting). 
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• The process puts a lot of stress on technical authorities - so called 'gurus* - to govern 
the overall best way to allocate buffers, manage downconversion, insert digital filters, 
generate good channel models and so on. This is generally a disadvantage since it adds a 
critical path and key personnel dependency to the project of stack production and 

5 lengthens timelines. The resulting product is quite likely not to include all the appropriate 

current technology because no individual is completely expert across all of the prevailing 
best practice, nor will the gurus or their team necessarily have time to incorporate all of 
the possible innovations in a given stack project even if they did know them, 

• The reliance on manual computation of MIPs and memory requirements, and the 
10 bespoke nature of the DSP modules and infrastructure code for the stack, means that 

there is an increased probability of error in the product 

• An associated point is that generally real-time prototyping of the stack is not possible 
until the 'rack' is built; a lack of high- visibility debuggers available even at that point 
means that final stack and resource 'lock off is delayed unnecessarily, pushing out the 

15 hardware production time scale. High visibility debuggers would, if available, be very 

useful since they provide, when developing in a high level language like C++, the ability 
in the development tool to place break points in the code, halt the processing at that 
point and then examine the contents of memory, sin^e step instmctions to see their 
effects, etc. Triggers can then also be placed in die code that will stop execution and start 

20 up the debugger when particular conditions arise. These are very powerful tools when 
developing application software. TLdck-off refers to the fact that when one phase of the 
project is complete, development can move onto the next In a hardware development 
you cannot iterate as easily as in software as each iteration requires expensive or time 
consuming fabrication. 

25 • Because it is likely that low-level modules or hardware acceleration 'controllers' will have 
to be developed for the stack being produced, developers will have to become familiar 
with the assembly language of the target processor, and will become dependent upon the 
development tools provided for that processor. 



wo 01/53999 



PCT/GBOl/00278 



7 

• Lack of modularity coupled with the fact that the infrastructure code is not reused means 
that much the same work will have to be redone for the next digital broadcast stack to be 
produced. 

Coupled with these difficulties are an associated set of 'strategic' problems that arise from 
this type of approach to stack development, in which stacks are inevitably strongly attached 
to a particular hardware environment, namely: 

• From the stack producer's point of view, there is an uncomfortably close relationship 
with the chosen DSP hardware platform. Not only must this be selected carefully since 
mistakes will require a costly (and time-consuming) port, but the development tools, low- 
level assembly language, test 'rack' hardware development and final platform ASIC 
production will all be architecture-specific. If an opportunity to use the stack on another 
hardware platform comes up, it will first have to be ported, which will take quite a long 
time and introduce multiple codebases (and thereby the strong risk of platform-specific 
bugs). The code base is the source code that underpins a project. Ideally when 
developing software you would have a one to one mapping between source code and 
functionality, so if a nvimber of projects require a particular function they would all share 
the same implementation. Thus, if that implementation is improved all projects will 
benefit What tends to happen, however, is that separate projects have separate copies of 
the code and over time the implementations diverge (rather like genes in the natural 
worlc^. When projects use different hardware, under the conventional development 
paradigm, it is sometimes impossible to use the same code. And even if the same 
hardware platform becomes available with an upgraded specification, the code will still 
have to undergo a 'mini-port' to be able to use those additional features (more on-board 
memory, for example, or a second MAC (Multiply Accumulate) unit). 

• From the hardware producer's point of view, there is an equally uncomfortably close 
relationship with the software stacks. Hardware producers do not want (on the whole) to 
become experts in the business of stack production, and yet without such stacks (to turn 
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their devices into useful products) they find themselves unable to shift units. For the 
marketplace, the available 'software base' can obscure the other features upon which the 
hardware producer's products ought more properly to compete (such as available MIPs, 
power consumption, available hardware IP, etc.). 
5 • Operating system providers (such as Symbian Limitec^ find it essential to interface their 
OS with baseband communications stacks; in practice this can be very difficult to 
achieve because of the monolithic, power hungry and real-time requirements of 
conventional stacks. 

10 Reference may be made to eXpressDSP Real-Time Software Technology from Texas 
Instruments Incorporated. This suite of products enables the reduction of development and 
integration time for DSP software. But it exemplifies many of the disadvantages of 
conventional design approaches since it is tied exclusively to tiie Texas Instruments DSP 
platform. Further detailed differences of one implementation of the present invention over 

15 the eXpressDSP Real-Time Software Technology suite are summarised in the Detailed 
Description. 

SUMMARY OF THE PRESENT INVENTION 

20 

In accordance with a first aspect of the present invention, there is provided a method of 
designing, modelling or fabricating a conamunications baseband stack, comprising the steps 
o£ 

(a) creating a description of one or more of the following parameters of the 
25 baseband stack 

(i) resource requirements; 
QI) capabilities; 
(ii^ behaviour; and 
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(b) using that description as an input to software comprising a virtual machine 
layer optimised for a communications DSP in order to generate an emulation of the 
baseband stack to be designed, modelled or fabricated. 

Hence, the present invention contemplates (i) applying a form of 'emulation' to the domain 
of communications baseband stack design and (ii) introduces the idea of using a virtual 
machine layer optimised for a communications DSP in this context This approach makes 
accurate simxilation of resource utilisation (e.g. processor requirements, peak resource 
situations, state considerations etc.) possible. The term 'emulation' used in this specification 
should be broadly construed in this context to include any process which enables a system 
(whether hardware or software) to behave in the same or a similar way to another system 
(whether hardware or software). Modifications and refinements can be made at an early 
design stage with the present invention, improving design quality, reducing the chance of 
costiy design errors and reducing overall time to market. 

Preferably, the method includes the following stages: 

(a) using, for one or more components to be incorporated in the baseband stack, 
a component description which defines some or all of the externally visible attributes 
of a component, as well as its behaviour, as an input to a mathematical modelling 
tool programmed to output component related performance data for each 
component; 

(b) processing the component related performance data for each component to 
yield a baseband stack description; 

(c) creating a resources description defining the resources of the baseband stack; 

(d) creating an interface description defining how each component is to used in 
the baseband stack; and 

(e) using each of the baseband stack description, the resources description, and 
the interface description as the inputs to the software. 
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The software can therefore emulate accurately the baseband stack; it can also be both 
instrumented and interpreted/ compiled to output diagnostic information in respect of a 
component in the same format as (e.g. in order to merge with) die component description 
for that component in order to refine the quality of the component description. This 
feedback loop can be very effective in rapidly extracting accurate data and feeding it back 
into the design loop. 

Another advantage of this structured approach is that hardware components can be 
progressively introduced into a test system: a first test may be carried out using software to 
emulate a given hardware component as part of a design or modelling process; the emulated 
component is then replaced with die hardware component, and a fiirther test is carried out. 
Problems and unexpected consequences of using hardware components can therefore be 
more readily identified. In the same way, ports of individual stack modules can be made to a 
particular architecture and tested: for example, imagine a baseband stack comprising 
modules A, B and C: once fully tested in a software emulation, module A can be ported onto 
the target DSP and the system re-tested, with module A running on the target DSP and 
modules B and C continuing to run on the emulator. Problems can therefore be more 
readily identified and resolved. 

In addition to emulating the baseband stack, the method can be used to fabricate an actual 
baseband stack implementation (i.e. generate executable code running on the target 
platform) by compiling automatically generated source code. 

The method of the present invention may utilise a standardised description of the 
characteristics (including non-interface behaviour) of communications components to enable 
the emulation to accurately estimate the resource requirements of a system using those 
components. This is referred to as the Component Definition Language - ('CDU) in the 
embodiment described in the Detailed Description. Communications components are 
conventionally described with a variety of ad hoc labels. This renders any systematic 
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approach to simulation impossible. Using the standardised description system, component 
developers will be able to publish their component specifications for potential developers to 
make use of. Product developers will be able to benchmark their solution using a number of 
potential suppliers simply by plugging in different data files. It will also be possible for a 
5 system builder to calculate, through repeated simulation or mathematically, the ideal 
specifications of the components they want Once they have completed this process they will 
be able to approach potential suppliers armed with precise details of what they require. 

Further, the method of the present invention may also utilise a language designed to define 
10 comphUhf the functionality of a baseband stack (e.g. receiver/transceiver) to estimate, simulate 
or fabricate a real device using the above design process. This is referred to as the Device 
Definition Language - (T>DU) in the embodiment described in the Detailed Description. This 
leads to many advantages: Currentiy, defining the fimctionality of a receiver/transceiver is 
often done in a non-systematic ad hoc manner. DDL however allows the exchange of 
15 information between any number of diverse applications, design tools and visualisers. It will 
also be architecture independent and provide a reliable medium of exchanges between 
individuals, companies etc. The language wiU be extensible to allow it to incorporate 
innovations io the fiiture and so tiiat third parties can incorporate their own components. 

20 At this point, some further elaboration on the meaning of a Virtual machine layer* is 
appropriate. A Virtual machine' typically defines the functionality and interfaces of the ideal 
machine for implementing the type of applications relevant to the present invention. It 
typically presents to the using application an ideal machine, optimised for the task in hand, 
and hides the irregularities and deficiencies of the actual hardware. The Virtual machine' 

25 may also manage and/ or maintain one or more state machines modelling or representing 
communications processes. The Virtual machine layer* is then software that makes a real 
machine look like this ideal one. This layer will typically be different for every real machine 
type, A Virtual machine layer^ typically refers to a layer of software which provides a set of 
one or more APIs (Application Program Interfaces) to perform some task or set of tasks 
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(e.g. digital signal processing) and which also owns the critical resources that must be 
allocated and shared between using programs (e.g. resources such as memory and CPU). 

The virtual machine layer in an implementation of the present invention is preferably 
5 optimised to allocate, share and switch resources in such a way as is best for digital signal 
processing; a typical operating system, in contrast, will be optimised for general user- 
interface programs, such as word processors. Thus, for example, the resource switching 
algorithms in this case will typically operate on much smaller time increments than that of an 
end~user operating system and may control parallel processes. 

10 

The virtual machine layer, optimised for a communications DSP, insulates software 
baseband stacks from the hardware upon which they must execute. Hence, baseband stacks 
can be made very portable since they can be isolated by the virtual machine layer from 
changes in the underlying hardware. The virtual machine layer may also manage flow 
15 control between different connected modules (each performing different functions); this 
may be done on a concurrent basis. It may also define common data structures for signal 
processing, as will be described in more detail subsequently. 

The software of the present invention may be used in a development environment to enable 
20 a communications device, (e.g. a baseband stack, or indeed an entire SoC including several 
baseband stacks from different vendors, or an end product such as a mobile telephone) to be 
modelled and developed or to actually perform baseband processing. 

The potency of applying the Virtual machine layer' concept to the domain of 
25 communications DSPs can best be understood through an example from a non-analogous 
field. In the field of PC software, Microsoft's Windows™ operating system (sitting on top 
of the system BIOS) insvdates software developers from the actual machine in use, and from 
the specifics of the devices connected to it. It provides, in other words, a 'virtual machine 
layer* upon which code can operate. This is schematically illustrated in Figure 1 Because of 
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this virtual machine layer, it is not necessary for someone writing a word processor, for 
example, to know whether it is a Dell or a Compaq machine that will execute their code, or 
what sort of printer the user has connected (if any). Furthermore, the operating system 
provides a set of common components, functions and services (such as file dialog panels, 
5 memory allocation mechanisms, and thread management APIs). Because only written once, 
the rigour, extent and reliability of such 'common code' is greatly increased over what would 
be the case if each application had to re-implement it, over and over again. Further, the 
manufacturers of PC hardware are protected from the complexities of software 
development, having only to provide a BIOS and drivers from the appropriate Windows 
10 APIs in order to take advantage of the vast array of existing software for that platform. This 
situation can be contrasted with the pre-Windows situation in which each application would 
frequendy contain its own custom GUI code and drivers, as illustrated in Figure 2. 

A key enabler for the PC Windows 'virtual machine layer" approach is that a large number of 
15 applications require largely the same underlying 'virtual machine' functionality. If only one 
application ever needed to use a printer, or only one needed multithreading, then it would 
not be effective for these services to be part of the Windows 'virtual machine layer*. But, this 
is not the case as there are a large number of applications with similar I/O requirements 
(windows, icons, mice, pointers, printers, disk store, etc.) and similar 'common code' 
20 requirements, making the PC 'virtual machine layer* a compelling proposition. 

However, prior to the present invention, no-one had considered applying the Virtual 
machine' concept to the field of communications DSPs; by doing so, the present invention 
enables software to be written for the virtual machine rather than a specific DSP, de- 
25 coupling engineers from the architecture constraints of DSPs firom any one source of 
manufacture. This form of DSP independence is as potentially usefiil as the hardware 
independence in the PC world delivered by the Microsoft Windows operating system. It is 
illustrated schematically in Figures 3 and 4. Figure 3 shows a conventional situation in 
which parts of the baseband stack which should, when properly implemented, be 
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architecturally neutral are in &ct not properly isolated from the substrate harchrare; Figure 4 
depicts how the virtual machine layer (called the Communications Virtual Machine or CVM) 
of the present invention does successfully isolate these parts of the baseband stack. 

5 There are therefore several key advantages to the CVM: 

• Porting baseband stacks across DSP architectures and to different media access 
hardware (such as, for example, porting a stack for a GSM phone operating at 900 MHz 
to one operating at I8OOMH2) will be much faster since the invention enables stacks to 
10 be designed which are not architecture or spectrum specific: a critical advantage as time 
to market becomes ever more important. Hence, a stack will work on any DSP 
architecture to which the virtual machine layer has been ported. Likewise, a DSP to 
which the virtual machine layer has been ported wiU run all the stacks written for the 
virtual machine layer. 

15 ♦ Much of the high MIPS, complex code (e.g. a Viterbi decoder) will be written once only 
for the virtual machine layer, as opposed to many different times for each DSP 
architecture. Hence, quality and reliability of this complex code can be economically 
improved. That in turn means that the baseband stacks will themselves need less code 
and what stack code there is need be less complex, thus increasing its reliability. 

20 • The virtual machine layer provides the ability to prototype either entirely in software or 
with a mixture of software and proven DSP components, allowing the identification of 
algorithmic deficiencies and resource requirements earlier in the development cycle. 

Preferably, the virtual machine layer is programmed with or enables access to various core 
25 processes and/ or core structures and/or core functions and/or flow control and/or state 
management. The core processes with which the virtual machine layer is programmed (or 
enables access to) include one or more 'common engines*. These 'common engines* 
perform one or more of the baseband stack functions, namely: source coding, channel 
coding, modulation and their inverses (source decoding, channel decoding and 
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demodulation). The 'common engines' include die fast Fourier transform (FFT), Viterbi 
decoder (with various constraint lengths, Galois polynomials and puncturing vectors), Reed- 
Solomon engines, discrete cosine transform (DCT) for the MPEG decoders, tune and 
frequency bitwise re-ordering for error decoherence, complex vector multiplication and 
5 Euler synthesis. A more extensive list is contained at Appendix 1. One or more of these 
parameterised transforms are commonly required by communications baseband stacks. This 
subsidiary feature is predicated on the inventive insight that a set of common processes is 
found within almost all of the key digital broadcast systems; an example is the similarity of 
GSM to DAB: both, for example, use interleaving and Viterbi decoding. Commonality is 
10 hence predicated on a common mathematical foundation. 

In addition, a *core structure' may also be present in each case. The 'core stmcture* involves 
splitting the decoding chain up into a symbol processing section (concerned with processing 
full symbols, regardless of whether all the information held within that symbol is to be used) 
15 and data directed processing, in which only those bits which hold relevant information are 
processed. In each case, it is highly desirable that the processing modules are able to allocate, 
share and dispose of intermediate, aligned memory buffers, pass events between themselves, 
and exist within a framework that enables modular development 

20 The core function may relate to resource allocation and scheduling, include one or more of 
the following: memory allocation, real time resource allocation and concurrency management 

The software can preferably access PC debug tools, which are far superior in performance 
and capability than DSP design tools. It may be subject to conformance scripting, as will be 
25 defined subsequentiy. In addition, it may operate with a component, in which only that 
information necessary to enable it to operate with and/or otherwise model the performance 
of the component is supplied by the owner of the intellectual property in the component 
This enables the owner of the intellectual property (which can be valuable trade secret 
information such as internal details, design and operation) to hide that information, releasing 
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only fiar less critical information, such as the functions supported, tiie parameters required 
the APIs, timing and resource interactions, and the expected performance for 
characterisation estimation 

5 Since the CVM draws together the ideas introduced above, and is a critical aspea of an 
implementation of the present invention, it is summarised in the following section. 

Summary of the CVM implementation 

The CVM is botia a platform for developing digital signal processing products and also a 
10 runtime for actually running those products. The CVM in essence brings the complexity 
management techniques associated witii a virtual machine layer to real-time digital signal 
processing by 0 placing high MIPS digital signal processing computations (which may be 
implemented in an architecture specific manner) into 'engines' on one side of the virtual 
machine layer and (ii) placing architecture neutral, low MIPS code (e.g. the Layer 1 code 
15 defining various low MIPS processes) on the other side. More specifically, the CVM 
separates all high complexity, but low-MIPs control plane and data 'operations and 
parameters' flow fimctionality from the high-MIPs 'engines' performing resource-intensive 
(e.g., Viterbi decoding, FFT, correlations, etc.). This separation enables complex 
communications baseband stacks to be built in an 'architecture neutral', highly portable 
20 manner since baseband stacks can be designed to run on the CVM, rather than the 
imderiying hardware. The CVM presents a uniform set of APIs to the high complexity, low 
MIPS control codes of these stacks, allowing high MIPS engines to be re-used for many 
different kinds of stacks (e.g. a Viterbi decoding engine can be used for both a GSM and a 
UMTS stock). 

25 

The CVM can form part of a design tool which can support stochastic simulation of load on 
multiple parallel datapaths (distribution to underiying 'engines' of the virtual machine) where 
the effect of the distribution of these datapaths to different positions within a potentially 
heterogenous conununications DSP topology or a non-symmetric memory topology (e.g.. 
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some components being local, others accessible across a contested bus, etc) may be explored 
with respect to expected loading patterns for given precomputed scenarios of use. The 
output of such a design tool is an initial partitioning of the design 'engines' (high-MIPs 
components) into variously distributed *hard' and 'soft' datapaths (where a hard datapath is a 
flow implemented in an ASIC or FPGA, and soft datapath is a flow implemented over a 
conventional programmable DSP). This partitioning is visible to the dynamic scheduling 
engine (by means of which the high level, architecture neutral software dispatches its 
processing requests to the underlying engines) and is utilised by it, to assist in the process of 
maldng optimal or close to optimal runtime scheduling decisions. 

Duriag the development stage of a digital signal processing product, the MIPS requirements 
of various designs of the digital signal processing product can be simulated or modelled by 
the CVM in order to identify die arrangement which gives the optimal access cost (e.g. will 
perform with the minimum number of processors); a resource allocation process is used 
which uses at least one stochastic, statistical distribution function, as opposed to a 
deterministic function. Simulations of various DSP chip and FPGA implementations are 
possible; placing high MIPS operations into FPGAs is highly desirable because of their speed 
and parallel processing capabilities. 

During actual operation, a scheduler in the CVM can intelligentiy allocate tasks in real-time 
to computational resources in order to maintain optimal operation. This approach is 
referred to as *2 Phase Scheduling' in this specification. Because the resource requirements of 
different engines can be (^ explicitly modelled at design time and ^ intelligentiy utilised 
diiring runtime, it is possible to mix engines from several different vendors in a single 
product As noted above, these engines connect up to the Layer 1 control codes not 
direcdy, but instead through the intermediary of tiae CVM virtual machine layer. Further, 
efficient migration firom the non-real time prototype to a run time using a DSP and FPGA 
combination and then onto a custom ASIC is possible using the CVM. 
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The CVM is implemented with three key features: 

• Dynamic, multi-memory-space multiprocessor distributed scheduler with support for 
co-scheduling. 

• APIs to commonly used, high-MIPs operations for digital broadcast and 
communications, with architecture-native implementations. 

• Resource mmsLgement and normalisation layer provided over the native RTOS). 

The CVM can exist in several ^pipeline' forms. A 'pipeline' is a structure or set of 
interoperating hardware or software devices and routines which pass information from one 
device or process to anotiier. In die DSP environment, such pieces of information are often 
referred to as 'symbols'. Pipelines can be implemented also as data flow architectures as well 
as conventional procedural code and all such variants are within the scope of the present 
invention. The CVM can also be conceptualised and implemented as a statcmachine or as 
procedural code and again all such variants are within the scope of the present invention. 

One instance of the CVM contains an Interpreted Pipeline Manager, which incorporates 
run-time versions of the CVM core. By ^interpreted' we mean that its specification has not 
been translated into the underlying machine code, but is repeatedly re-translated as the 
program runs, in exactly the'same was as an interpreted language, such as BASIC. 

Another instance is an Instrumented Interpreted Pipeline Manager which incorporates run- 
time versions of the CVM core. This operates in the same was as an Interpreted Pipeline 
Manager, but also produces metrics and measiirements helpful to the developer. An 
interpreted non-instrumented version is also useful for development and debugging, as is a 
compiled and instrumented version. The latter may be die optimal tool for developing and 
debugging. 

Another version of the CVM is a Pipeline Builder. Instead of running, it outputs computer 
source code, such as C, which can be compiled to produce a Pipeline implementation. For 
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this reason it must have available to it CVM libmies. It can be thou^t of as the compiled 
and non-instrumented variant 

The CVM apparatus may include or relate to a standardised description of the characteristics 
5 0»icluding non-interface behaviour) of communications components to enable a simulator to 
accurately estimate die resource requirements of a system using those components. Time 
and concurrency restraints may be modelled in the CVM apparatus, enabling mapping onto a 
real time OS, with the possibility of parallel processing. 

10 Other features and aspects of the present invention are defined in the Claims of this 
specification. 



15 BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described with reference to the accompanying drawings in which: 

Figure 1 is a schematic showing die relationship between hardware and application software 
20 -wdien using Microsoft Windows; 

Figure 2 is a schematic showing the pre-Mcrosoft Windows relationship between hardware 
and application software; 

25 Figure 3 is a schematic showing die conventional failure to isolate supposedly architecturaUy 
neutral parts of a baseband stack; 

Figures 4A and 4B are schematics showing the successful isolation of architecturally neutral 
parts of a baseband stack in the present invention; 
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Figute 5 is a schematic showing the structure in a baseband communications stack; 

Figure 6 is a schematic showing the common engines and structure in an embodiment of 
the present invention; 

Figure 7 is a schematic showing the relationship between die CVM of the present invention, 
the hardware and the stack; 

Figures 8 and 9 are schematics showing steps in tihe development cycle using the present 
invention. 



DETAILED DESCRIPTION 

The present invention wiU be described witfi reference to the CVM implementation from 
RadioScape Limited of London, United Kingdom. 

CVM Overview 

The CVM is botii a platform for developing digital signal processing products and also a 
runtime for actually running those products. The CVM in essence brings the complexity 
management techniques associated with a virmal machine layer to real-time digital signal 
processing by (i) placing high MIPS digital signal processing computations (which may be 
implemented in an ardiitecture specific manner) into 'engines' on one side of the virtual 
machine layer and (ii) placing architecture neutral, low MIPS code (e.g. the Layer 1 code 
defining various low MIPS processes) on the other side. More specifically, the CVM 
separates all high complexity, but low-MIPs control plane and data 'operations and 
parameters' flow functionality from die high-iVIIPs 'engines' performing resource-intensive 
(e.g., Viterbi decoding, FFT, correlations, etc.). This separation enables complex 
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communications baseband stacks to be built in an 'architecture neutral*, highly portable 
manner since baseband stacks can be designed to run on the CVM, rather than the 
underlying hardware. The CVM presents a uniform set of APIs to the high complexity, low 
MIPS control codes of these stacks, allowing high MIPS engines to be re-used for many 
different kinds of stacks (e.g. a Viterbi decoding engine can be used for both a GSM and a 
UMTS stack). 

The virtual machine layer supports underlying high MIPs algorithms common to a number 
of different baseband processing algorithms, and makes these accessible to high level, 
architecture neutral, potentially high complexity but low-MIPs control flows through a 
scheduler interface, which allows the control flow to specify the algorithm to be executed, 
together with a set of resource constraint envelopes, relating to one or more of: time of 
execution, memory, interconnect bandwidth, inside of which the caller desires the execution 
to take place. 

During the development stage of a digital signal processing product, the MIPS requirements 
of various designs of the digital signal processing product can be simulated or modelled by 
the CVM in order to identify the arrangement which gives the optimal access cost (e.g. will 
perform with the m inimum number of processors); a resource allocation process is used for 
modelling which uses at least one stochastic, statistical distribution function (and/or a 
statistical measurement function), as opposed to a deterministic function. Simulations of 
various DSP chip and FPGA implementations are possible; placing high MIPS operations 
into FPGAs is highly desirable because of dieir speed and parallel processing capabilities. 

During actual operation, a scheduler in die CVM can intelligentiy allocate tasks in real-time 
to computational resources in order to maintain optimal operation. This approach is 
referred to as *2 Phase Scheduling" in this specification. Because the resource requirements of 
different engines can be (i) explicidy modelled at design time and (ii) intelligendy utilised 
during runtime, it is possible to mix engines from several different vendors in a single 
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product As noted above, these engines connect up to the Layer 1 control codes not 
directly, but instead through the intermediary of the CVM virtual machine layer. Further, 
efficient migration from the PCT non-real time prototype to a run time using a DSP and 
FPGA combination and then onto a custom ASIC is possible. 

The CVM is implemented with three key features: 

• Dynamic, multi-memory-space multiprocessor distributed scheduler with support for 
co-scheduling. 

• APIs to commonly used, high-MIPs operations for digital broadcast and 
communications, with architecture-native implementations. 

• Resource management and normalisation layer (provided over the native RTOS). 

The CVM is a design flow solution as well as a rtmtime 

The CVM provides a complete design flow to complement the runtime. This provides the 
engineer with fully integrated mathematical models, statistical simulation tools (essential for 
operation with bursty data), a priori partitioning simulation tools (to determine e.g., whether 
a datapath should go into hardware or run in software on a DSP core). Through the use of 
custom libraries for mathematical modelling tools (e.g. Madab / Simulink), the CVM is able 
to model in detail and with bit-exact accuracy the high-MIPs engine operations, allowing 
engineers to determine up front how many bits wide the various datapaths must be, etc. 
However, the system is also able to accept XML commands from a statistically simulated 
control plane, allowing birth/death events and burstiness to be handled within the context of 
the model. Furthermore, since even the simulation engines are accessed through the 
scheduler's indirection interface, it is possible to plug in calls to e.g. real hardware 
implementations to speed simulation execution. 

It is also, importantiy, possible to perform simulation of resource loading under various 
system partitioning decisions. How many instances of a particular algorithmic 'engine' (e.g., a 
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Viterbi decoder, a RAKE receiver element, a block FFT operation, etc.) are required to 
provide sufficient cover under various statistical loadings? What happens if a datapath is 
moved across a latent and/or contended resource such as a bus? What if the datapath is 
implemented in hardware rather tfian software? All of these decisions are critical but existing 
5 toolsets have not addressed them, and this is doubly true when the partitioning decisions are 
being made with respect to multiple, third-party IP engines or engines (see belovi). The CVM 
design flow explicidy enables these sorts of design decisions to be answered. Furthermore, 
initial partitioning information is then *fed forward' from die design toolset into the runtime 
scheduler, enabling it to vector requests off to the appropriate engine instances for 
10 implementation when the system is under actual dynamic load. 

Working jBrom the ^bottom up*, treating the software largely as an afterthought, is not longer 
a viable route to market; this path simply takes too long, yields a result that is too 
architecture-specific, and has a bad 'fif to die parallel, state-machine nature of the underlying 
15 domain. Working firom tiie 'top down', the paradigm utilised by the CVM, provides a much 
more powerftil and extensible solution. 

A final point about the CVM is that by separating out the control flow code firom the 
underlying engines, it becomes possible to perform a lot of development work on 
20 conventional platforms (e.g., PCs) without having to work with the actual embedded target 
This allows for much faster turnaround of designs than is generally possible when using a 
particular vendor's end target development platform. 

Example: The CVM is a design solution for hard real time, multi-vendor, multi- 
25 protocol environments such as SoC for 3G systems 

One of the core elements of tiie CVM is its ability to deal with (potentially conflicting) 
resource requirements of third party software/hardware in a hard real time, multi-vendor, 
multi-protocol environment. This ability is a key benefit of the CVM and is of particular 
importance when designing a system on chip ^oQ. To understand this, consider the 
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problems faced by a would-be provider of a baseband chip for the 3G cellular phone market 
First, because of the complexity of the layer 1 processing required, simply writing code for 
an off-the-shelf DSP is not an option; an ASIC will be required to handle the complexities of 
dispreading, turbo decoding, etc. Secondly, since UMTS will only be rolled out in a small 
number of metro locations initially, the chip will also need to be able to support GSM. It is 
unlikely that the company producing the baseband chip will have extensive skills in both 
these areas, therefore IP will need to be licensed in. This point becomes particularly relevant 
in light of the ever increasing time-to-market pressures for technology companies. But 
licensing in part-hardware, part-software IP engines from multiple vendors for layer 1 
provides a real problem. First, there is no current cocamon simple standard for 'mix and 
match' IP in this manner. What is needed, and what the CVM design flow provides, is a way 
to characterise both the static and dynamic resoxirce requirements of a 3"* party IP blocks so 
that it may be co-scheduled in real time with other IP engines, potentially from an entirely 
different supplier, and then connected transparently through to the higher level layer 1 
control code. Furthermore, the nature of the CVM is that these high-level overall call 
structures and control planes can be produced in an architecture-neutral language (e.g., SDL 
compiled to ANSI Q, with only the low-level, high-MIPs parts being implemented directly 
in an architecture-specific form. 

As noted above, the high MIPs functionality contained within the engines represent 
complete operational routines. These engines may be implemented in hardware or software 
or some combination of the two, but this is unimportant firom the point of view of the high 
level 'calling* code, which is entirely abstracted from the engines. The high-level IP 
communicates with the underlying engines via CVM scheduler calls, which allow the hard 
real-time dynamic resource constraints to be specified The scheduler then dispatches the 
request to the appropriate datapath for execution, which may involve calling a function on a 
DSP, or passing data to an FPGA or ASIC. Importanriy, the scheduler can deal with 
multiple hard datapaths that may have different access and execution profiles - for example, 
an on-bus Viterbi decoder, an on-chip software based decoder, and an off-chip dedicated 
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ASIC accessed via external DMA and pass particular requests off to the appropriate unit, 
which is completely independent from the calling highJevel code. 

This also means that, where two different communications stacks require some common 
high-MIPs engines, a vendor of an appropriate (platform-specific) engine implementation 
(whether designed in hardware, software, or some combination of both) can sell into both 
markets, and, if the two standards are implemented on a single SoC, both stacks can 
potentially share the same accelerator. In addition, die CVM specifies a set of over 100 core 
operations which taken together provide around 80% of the high-MIPs fimctionality found 
in the vast majority of digital broadcast and communications protocols. The CVM runtime 
also provides a wrapper around die underlying RTOS, presenting the high-level code with a 
normalised interface for resource management (including threads, memory, and external 
access). 

Using the CVM, it is possible to construct an integrated development platform for 
communications SoC products, in which a number of third party vendors are able to publish 
their IP, as either high-level architecture neutral SDL or C++ components, or architecture 
specific, resource profiled engines (which can be hardware, software, or a combination of 
both). An integrated design flow would enable the SoC designer to produce an overall 
system tiiat contains tiae appropriate engbes (chosen from particular vendors), add her own 
IP on botii or either side of die CVM, and dien generate both the deployable hardware 
specification (as a number of VHDL-defined cores, togetiaer with accelerators) and software 
components. It is possible to construct a toolset which would provide a complete flow 
dirough mathematical modelling, statistical a priori stochastic simulation for partitioning, 
protocol verification and final system generation and provide appropriate mechanisms to 
characterise, publish, enumerate and use libraries of 'packaged' IP within designs. 
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This system would have the potential to become die main workbench for SoC designers, 
who would only have to go into VHDL tools to develop the high-MIPs engiiies, not any of 
the layer 1 control fabric 

5 

The CVM allows SDL to be used in designing Layer 1 

As noted above, the CVM allows the low-MEPs code to be written in an architectural neutral 
manner, using either ANSI C++ or, preferably, SDL which may then be compiled to ANSI 
C. SDL is a language widely used within the teleconununication industry for the 

10 representation of layer 2 and layer 3 stacks, and is particularly well suited to systems that are 
most economically expressed in a state machine format. SDL traditionally would not be 
appropriate for use below layer 2 (the end of the 'soft real time* domain). The SDL code is 
entirely portable between various architectures, and may be tested in the normal manner 
using tools such as TTCN. System constraints (such as dynamic resource ceilings) can be 

15 attached to various portions of the code and substrate interconnects in development and 
then simulated with realistic loading models to allow up-front partitioning of the datapaths 
into hardware and software. Importantiy, the CVM schedule is cognisant of the datapath 
pardoning decisions taken during the design time portion of the development process. The 
toolflow is fuUy integrated with Matlab and Simulink, allowing bit-accurate testing of high- 

20 MIPs functionality. The use of SDL as the preferred language for the high-level logic flows 
within layer 1 is not accidental - SDL has been widely used within layers 2 and 3 of 
telecommunications stacks such as GSM, but has not crossed the chasm into the hard real 
time domain. With the CVM, by contrast, it becomes possible to invoke parallel, hard real 
time execution from SDL control flows, thereby allowing the extremely powerfiil and natural 

25 state machine expressiveness of SDL to be used to author the high level layer 1 algorithms. 
Increasingly, although low MIPs these algorithms are themselves extremely complex, as they 
must deal with issues such as bursty rate matching, user transport channel birth / death 
events, handovers between multiple standards, and QoS-bound graceful degradation under 
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load, to name but a few. Odier languages not designed for real-time operations (e.g. C++ 
and Java) can also be used in designing Layer 1, as alternative s to SDL. 

Theoretical background to the CVM 

Current digital communications systems are built around a largely common consensus, 
which has emerged in the last 15 years or so, about the best way to reliably transmit 
information wirelessly in the face of quite severe channel effects. Two-way systems have 
somewhat different channel and modulation requirements from broadcast-oriented systems 
(for example, using CDMA to provide graceful degradation in the face of a congested 
spectral band, and having some 'hard' real time requirements), but overall much 
commonality exists. 

For example, in the specific case of broadcast (one-way) systems, decoders and encoders 
may be seen as simply parallel 'protocol stacks'. Most broadcast transmission systems start 
with source coding (such as MPEG; tiiis compresses the input to reduce bitrate) followed by 
channel coding (such as convolutional and Reed-Solomon coding; this adds structured 
redundancy to improve the abiUty of the receiver to extract information despite signal 
corruption) followed by modulation (at which point a number of subcarners are modified in 
some combination of angle (frequency or phase) or amplitude to hold the information. The 
reverse process is then carried out in the receiver, yielding (on one level) the diagram of 
Figure 5. Hence, a set of common processing engines are found widiin almost all of the key 
digital broadcast systems, and a common processing structure may also be applied in each 
case. 

The CVM embodiment exploits this as follows: the co/j!;/;on en^nes, (or functions or libraries) 
include algorithms to perform one or more of the following: source coding, channel coding 
modulation, or their inverses, namely source decoding, channel decoding and demodulation. 
They include for example, the fast Fourier transform (FFI), Viterbi decoder (with various 
constraint lengths, Galois polynomials and puncturing vectors), Reed-Solomon engines, 
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discrete cosine transform (DCT) for the MPEG decoders, time and frequency bitwise re- 
ordering for error decoherence, complex vector multiplication and Euler synthesis, etc. A 
more extensive list is at Appendix 1. These are high MIPS routines and therefore ideally 
implemented in a CVM in an architecture specific manner (either through assembly code or 
5 hardware accelerators). They can, regardless of this, be accessed in the CVM through 
common, high level APIs. Each of these parameterised transforms has a parallel . 
mathematical modelling block provided for it. 

The common stmcUm involves splitting the decoding chain up into a symbol processing secdon 
10 (concerned with processing full symbols, regardless of whether all the information held 
within that symbol is to be used) and data directed processing, in which only those bits 
which hold relevant information are processed. In each case, it is critical that the processing 
modules are able to allocate, share and dispose of intermediate, aligned memory buffers, pass 
events between themselves, and exist within a framework that enables modular development . 
15 The common structure is paralleled where appropriate in a mathematical modelling 
environment and described via graph description language (GDL). Figure 6 schematically 
depicts this common block and stmcture approach used in the CVM. 

A s imila r analysis may be provided for 2-way systems, except that there is an additional CCS 
20 (calculus of concurrent systems) requirement and resource allocation issue, and the required 
'critical mass' of processing engines is slightiy different. 

It is interesting that current generation third party application development tools and 
hardware deployment platforms (DSPs and DSP cores) do not reflect the stmctural realities 
25 discussed above, and do not (on the whole) provide hardware acceleration tailored towards 
commvmications baseband applications nor the 2 phase scheduling approach (see below). 
Nor do current embedded operating systems support these operations in any systematic or 
coherent manner. 
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However, the number of digital commumcations systems is increasing rapidly, creating a 
demand for rapid time-to-market deployment of baseband stacks. As explained above, a core 
innovative approach of the present invention is to exploit the underlying commonality and 
requirements of such systems by providing a software-hosted common 'virtual machine 
5 layer* (exemplified by the CVM embodiment) reifying these capabilities and software 
structure. One key commercial application is as a design solution for hard real time, multi- 
vendor, multi-protocol environments such as SoC (as noted above). 



10 CVM Development Methodologies 

The development methodology used in the CVM builds upon (and departs from) a 
methodology using layered development and layered deployment. These concepts will be 
discussed initially: Layered development refers to a process of progressing ftoin mathematical 
models, through C++ or SDL code to a target assembler implementation (if necessary). 

15 Throughout this process, each of the modules in question is maintained at each of the 
necessary levels (for example, a convolutional decoder would exist as a parallel mathematical 
model, C++ implementation, SIMD model and assembler implementations in various target 
languages). 

20 Liffered deployment refers to the use of libraries to isolate the code as far as possible firom the 
underlying hardware and host operating system when a receiver stack is actually 
implemented. Hence as much as possible of the code (high complexity but low MIPs 
requirement) is kept as generic SDL or ANSI-compliant C++ which is then simply 
recompiled for the target platform. For example, a library is used to provide platform- 

25 dependent functions such as simple I/O, allocation of memory buffers etc. Another library 
is used to provide high-cycle routines (such as the FFT, Viterbi decoder, etc.) in an 
architecture specific manner, which may involve the use of highly crafted assembler routines 
or even callthroughs to specialised hardware acceleration engines. 
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These two libraries, no matter what the underlying hardware and operating system substrate, 
are manifest as a common API to the 'core' code, which therefore does not have to be 
modified during a port. The only code which does get modified, namely the contents of the 
library implementations, benefits firom significant encapsulation and a wide variety of test 
vectors generated firom the mathematical models. It is because the points of articulation in 
the architecture are appropriately positioned that porting of stacks can be rapidly achieved 
using this approach. 

Furthermore, as a development platform, this approach has the great advantage that one can 
develop on one architecture (e.g. the Intel platform) running not a mathematical model but 
ratiier a full, real-time transceiver, and then simply swap the libraries and recompile on the 
target architecture. This is very usefiil when trying to e.g., tune an equaliser module. 

The CVM approach builds on this way of working. However, in addition, as much as 
possible of the common functionality is abstracted into the 'virtual machine' hardware 
abstraction layer, together with key services and functions that are useful for all digital 
coflMnunications baseband processing work. 

Figure 7 below shows how this would work at an architectural level Instead of the given 
stack being shipped with different library implementations for platform A and platform B, in 
the CVM there is a common 'baseband operating system' layer for each of platform A and 
platform B, providing a common API on top of which (apart firom a recompile) the higher 
level code can run unchanged. 

Furthermore, we can incorporate into this layer much of the functionality that otherwise 
would lie within the C++ core, such as the symbol subscriber architecture for symbol- 
directed processing, and the pipeline architecture for data directed processing. 
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Specific CVM Development Methodologies: Two Phase Scheduling 
Phase I 

5 An important aspect when building a Baseband communications system is quantifying the 
requirements of the hardware and software platform the application will run on. A baseline 
calculation of the number of MIPs (millions of instructions per seconc^ an application will 
require is relatively straigjat forward, simply calculate the requirements of each component to 
perform one operation, multiply by the number of operations and add them all together. 

10 This, however does not take into account aspects like parallelism. Althoxigh, theoretically,. 2 
X 500 MIPs processors will deliver 1000 MIPs of processing power the algorithms may not 
be able to take advantage of this if the are waiting for operations on another chip to 
complete. There are also the extra processing requirements of the scheduler and the data 
transfer overheads to consider. The data transfer penalty is probably small if both processors 

15 are on the same board but more significant if they are on separate boards plugged into an 
external bus. Bus contention (two or more processors wanting to transfer data at the same 
time) can also reduce overall efficiency. 

The CVM provides a number of methods to facilitate implementing systems in this sort of 
distributed enviroximent, 

20 

Initially we can quantify the requirements of the individual computing components such as 
the signal processing jEiinctions described in Appendix 1 and the more application specific 
engines built upon them. In enviroimients like 3G mobile communications the amount of 
data passing though a block will vary' over time so it is not sufficient just to calculate the 
25 requirements of a block at one data rate. Instead a profile will be built up over the range of 
potential input vector sizes. 
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The CVM allows a system to be defined as a collection of data flows (pipelines) where data is 
injected at one end, and consumed at the other. The engines on these pipelines are 
characterised in terms of how much processing they require as a function of input vector 
size. The first pass at calculating the MIPs usage is to simulate passing engines of varying size 
along this pipeline and calculating the total usage as a fimction of input block size. This 
calculates the total MIPs requirements of the engines assuming they are run sequentially to 
completion on a single processor. 

A more sophisticated model then assigns engines to separate processors and allows true 
pipelining. A solution based on this architecture will require more MIPs than the single 
threaded solution but has the potential, once the pipeline is loaded, to process data engines 
in shorter elapsed time. If N is the number of processors, E(N) the efficiency of processor 
utilisation (1 = 100%, 0 = zero), Mp the MIPs rating of a single processor and M the total 
MIPs requirement of the problem then tiie time to process 1 seconds worth of data T will 
15 be; 

T = M/(E(N)xNxMp) 



10 



20 The objective is to find the smallest value of N where T is less than 1 by a "comfortable" 
margin. E(N) will be dose to 1 for a single board and will drop as the number of boards is 
increased (because of the overheads introduced by scheduling and data transfer). E(N) will 
also vary depending on how the processing engines are distributed between the boards 
(because of the varying data transfer requirements and the possibility of uneven load 

25 balancing leaving an processor idle some of the time). 
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A CVM simulator that has knowledge of the schediding process, Ae characteristics of the 
bus and the characteristics of the engines \wll be able to calculate E(N) and hence T for 
different numbers of boards and engine arrangements. It will also be possible to investigate 
die effects of "doubling up" some of die engmes; diat is having the same functionality on 
more than one board. 

Once we know the sequence of engines diat are required for a task we can set the CVM to 
search through arrangements of engines and boards looking for the optimal solution. It will 
also be possible to have individual Mp -ralues for the boards (replace N x Mp by the sum of 
die individual Mps) and to tie speciHc engines to specific boards, for instance a Viterbi 
decoder wiU ah^rays run on an FPGA, which will have a higher MIPs rating tiian a DSP. For 
large numbers of engines exhaustive searches will become impractical and some assistance 
from an engineer will be required. 

Phase II 

Once we have and acceptable arrangements of engines and boards we can move onto phase 
two of die scheduling process, "doing it for real". Phase I will have generated a system 
configuration which can no be used to load the engines onto the correct boards. This 
information will also be made available to die scheduler on the main board Once the system 
is running data engines will flow firom the scheduler to the engbes that will operate on diem. 
Most of the time this scheduler will simply send data onward in the order they need to be 
processed but there will be occasions when more intelligence can be applied. When tfiere are 
multq>le engines of equivalent priority the scheduler will look to try and balance the queue 
sizes on all die boards by scheduling work to die least loaded. When the same functionality 
exists on more than one board the scheduler will again look for the most appropriate board 
to schedule. All die boards will have a local scheduler to obviate die need to involve die 
main scheduler in routing engines between two engines on the same board. When there is a 
choice of board to send work to schedulers will always choose tiieir own board when 
possible. The scheduler will also have to monitor die absolute urgency of die most urgent 
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engines looking for potential lulls in the processing when it can schedule less urgent 
activities, such as routing log messages and monitoring information back to a monitoring 
console 

More CVM Development Methodologies: the MIPS Counter as used in a UMTS 
implementation 

As noted above, the CVM consists of a number of distributed engines that are connected 
and controlled by the CVM Scheduler. These engmes may sit on the same hardware, but 
could sit on different hardware (CPU, DSP or FPGA.) For a UMTS implementation of the 
CVM, a system to identify bottienecks and aid in serialisng die engines/blocks has been 
developed. We first assume diat the processing route for a block of data is given; for 
instance the UMTS standards 25.212 and 25.222 suggest how the block is muxed in the 
TrCH stage. Some of the processing may then be switched between routes depending on 
some objective criteria such as BER, However, the required engmes are known. Then, the 
order of the engine must be determined in terms of die data size and number of users. For 
example, if a vector is of length n, and if die engine consists of for (int i=04 <n, i++) 
{ 

for (intj=0,j <n,j++) 
{ 

//Do something... 

} 
} 

then we can say that the process is an order n'^, or o(n'^2). Next we can count die number 
of operations (+\ in ( //Do sometiiing'). FFTs are for example n Log (n) processes. 
We can then multiply diis by die device's instructions per operation and then divide diis by 
die number of MIPS to get die time diat die device wiU take to perform a task Alternatively 
we can simply set a relative time. 
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The same process can be repeated for the number of users (K): for example MU can go as 
2^IC Finally, each block may or may not change the bit rate. Turbo Encoding increases it 
multiplicatively by a factor of 3.m CRC adds 12 bits. (Note, that bus latency, the scheduler, 
paraUelisation/ serialisation can all be considered to be engines). 

The point is that we know that data rate. The question answered by this process is how we 
can distribute the engmes (e.g. their MIPS budget) to accommodate this. 

TopDownDesign 

Traversing the processing chain is quite complex when state and data control are needed. 
This procedure is used to tie in RS C++ blocks dirough a standard adaptor to mtegrate with 
Simulink. Fundamentally, the intention is to move through hierarchies. As you move up 
layers, so the abstraction becomes higher and higher. The intention is to round trip data a 
'user* creates 3 services: The UE Tx this to the BS through a physical channel with certain 
properties. The BS receives and decodes the data. In this case the BS has a trivial backhaul, 
and retransmits the data back to the UE, through a physical channel, whereupon the data is 
compared to the input data. This system allows us to interchange engines to improve 
performance in terms of BER and time in a variety of chaimels. 

CVM Features 

The CVM can be thougjht of as a minimal OS to provide the sorts of functionality required by 
baseband processing stacks (and, as mentioned, these can be two-way stacks also, such as 
GSM or Bluetooth). It is therefore complementary to a full-blown embedded operating 
system like Mcrosoft Windows CE or Symbian's EPOC. 

The CVM provides {inter aJid) the following functionality: 

• Extensive set of vector-processing primitives (more completely listed at Appendix 1), 
covering operations such as FFTs, FIR and UR and wave digital filters, decimation. 
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correlation, complex multiplication, etc. These should use hardware acceleration where 
this is available on the underlying hardware, and would be accessed via a set of library 
calls paralleling an extended version of a library. In a sense, this aspect of the CVM 
represents a software or API abstraction of an idealised digital signal processing engine 
for digital communications. 

• Support for allocation of aligned buffers and memory 'handshaking' (ping-pong buffers). 

• Advanced scheduling management, with the option for pre-emptive mviltithreading of a 
simple kind Hard real-time performance (i.e., the ability to guarantee that a piece of code 
will execute at a particular point in time) will be supported as a key component of the 
architecture. Inter-process communication structures (at least shared memory) and 
thread synchronisation facilities will be provided. A key feature is a stochastic parallel 
scheduler, cognisant of design time partioning decisions for CVM engines across a 
heterogenous computational substrate. 

Explicit support for die notion of symbol and data directed processing. This will directiy 
support the ability to add symbol subscribers and pipeline stages into the structure to 
allow modular development. 

Support for key I/O peripherals, including serial ports, parallel ports and display 
controllers. 

Extensibility to enable the scope of the O/S to be increased, particularly for modular 
I/O support. 

Characterisation libraries for a particular implementation, allowing mathematical models 
and real-time prototypes to mimic the performance of the target substrate and 
interconnects to a high degree of accuracy, 
PC versions to enable the production of real-time prototypes. 

Support for commimication with a host (application) OS - tiiis will be bi-directional to 
enable callbacks and so on. A component intercommunication technology (e.g. COM) 
may be used to provide the binary 'glue'. A suitable application OS tnight be, for 
example, EPOC32 or Windows CE, as tiiese are OSs designed to perform the more 
usual user-level 1/ O and structured storage management 
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• Ability to 'pare doW the ROM image of the CVM at build time to ensure that the 
minimum ROM (hence, ultimately, chip area) is used. This uses a minimal 
implementation of the CVM. 

• State machine functionality management (including potential integration with SDL) 

• Support for data structures 

• Transfomis between different representations (such as fixed and floating point). 

The goal of the CVM is to enable the rapid deployment of particular (plications onto particular 
targets^ with the multiplicity of applications coming at the development stage. Conventional OSs 
are designed for run-time support of a variety of apps that are essentially unknown when the 
OS is loaded, but this is typically not the case with the CVM. Moreover, the CVM does not 
need to handle interaction with a user, except by supporting presentation streams through 
portals provided by the Tiost' OS. 

The CVM incorporates a number of the features that are airrentiy in the high-level C++ 
code of a DAB stack into the infrastmcture level (such as the appropriate modular structure 
for the development of symbol-directed and data-directed processing), and is not simply a 
'library wrapper*. 

The CVM concept rests upon the idea (critically dependent upon domain knowledge that 
can only be achieved through review of the various standards and the process of actually 
building the stacks) that abstracting the common functions and (importandy) processing 
structures required by modern digital broadcast and communications standards is possible 
and can be achieved elegantiy through an appropriate software abstraction layer coupled with 
a systematic layered development environment 

CVM Advantages 

With the CVM, stack developers are isolated jBrom the particular hard\rare in use. The CVM 
provides support for the structures (e.g., symbol and data-directed pipelines, and state 
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machines), functions (e.g., memory allocation and real time resource and concurrency 
management) and libraries (e.g., for FFT, Viterbi, convolution, etc.) required by digital 
comrnxmication baseband stacks to enable code to be written once, in a high-level language 
(SDL, ANSI C/ C++ or Java) and merely recompiled (if necessary, with Java it would not be, 
5 and COM or some other form of component intercommunication technology can provide 
the 'binary level* glue to link the modules together) to run on a particular platform, making 
calls through to the hardware abstraction layer provided by the CVM layer. 

Prototyping using die CVM will be very rapid, with each of the DSP modules paralleled by a 

10 mathematical model Memory allocation and partitioning will be supported by an automated 
toolset (parameterised by the desired target hardware) rather than relying on guesswork. 
Once the processing chain is established on the model (which will optionally be performed 
by graphical arrangement and parameterisation rather than coding and is working 
successfully, it will be possiblevto mn a real-time PC-based version (using the Intel 

15 MMX/SIMD version of the CVM, together with RadioScape's generic baseband processor 
module). Any changes to die standard code (e.g. a custom equaliser) may then be integrated 
in a modular, incremental fashion and the code-test-edit cycle (being PC basec^ could use all 
the latest PC development tools, and be very rapid. Use of hardware acceleration on the 
target platform will be covered by the CVM (since all of the required cycle-intensive features 

20 for digital communications baseband processing will be provided as library calls at the CVM 
API). Clearly, the use of an appropriately adapted underlying hardware unit, would provide 
targeted acceleration for most of the desired functions. For many applications, the support 
of lightweight pre-emptive multithreading and other low-level functions on the CVM itself 
will obviate the need to use any other RTOS, but interaction with a user-OS (such as 

25 Windows CE or Symbian's EPOQ will be supported and straightforward through the APIs 
discussed above. 

With this approach, a CVM-compatible stack, once written, would be portable instantiy to 
any of the hardware platforms onto which the CVM itself had been ported, (always 
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providing, of cotirse, that there were sufficient resources (MIPs, memory, bandwidth) on the 
target machine to execute die desired stack in real time) without invoking extra work This 
would represent a substantial market opportunity (assuming reasonable ctoss-platform 
penetration of the CVM) for stack vendors, as it will essentially insulate their developments 
from hardware specificity. There is also a particxilady significant commercial opportunity for 
designing multi-vendor SoC products (see above). 

From the hardware vendor's point of view, the advantage of the CVM is that once it is 
ported for a given processor, that processor would automatically support (resources 
permitting) all stacks tiiat had been written to die CVM APL This, of course, obviates the 
need for die hardware provider to get into die applications business; diey need only port the 
CVM. It also means that the need to produce and support a fiiU-specification development 
environment and toolset is reduced, since stack vendors (for the digital communications 
market at least) would tiien be able to develop code purely in ANSI C/C++ or Java. It 
should be noted that die CVM concept does not apply to all digital signal processing tasks, 
for example, making a PID controller for use in a car braking system. The reason that the 
CVM concept works for digital communication baseband processing is that, as explained 
above, there is a large pool of commonality in such systems that can be exploited; however, 
die CVM does not provide all die tools, structures or functions that would be required for 
otiier digital signal processing tasks, necessarily. Of course, it would potentially be possible 
to identify other such 'islands' of common function and extend Ae CVM idiom to cover 
daeir needs, but we are focussed here on die baseband aspects because they are highly in 
demand, and strongly exhibit the necessary commonality. The CVM approach leaves the 
hardware vendor firee to compete not on the existing application set, but rather on die 
virtues of dieir hardware (e.g., MIPs, targeted acceleration, memory, power consumption). 

The CVM Development Cycle 

The process of actually using the CVM to develop a baseband stack will now be described. 
For the purposes of dus specification, a device is the taiget being developed, such as a digital 
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radio. A component is an identifiable specific part of it: either software, hardware, or both. 
Interpreted' means code (possibly compilec^ which reads in configurations at run time. 

The CVM Development Cycle begins with the ^Component Definition Language'. This 
5 language enables the full externally visible attributes of a component to be specified, as well 
as its behaviour. The intention is that this can be written by a manufacturer or (as will be 
seen later) could be generated by test runs of an instrumented CVM. 

Via a set of plug-ins the Component Definition Language can be read in to a mathematical 
10 modelling tool, such as the industry popular MatLab or Mathematica. Using the modelling 
tool, the theoretical behaviour of all components to be used in the device would be explored 
and understood. 

The results of this investigation would then be either transcribed, or output via another plug 
15 in to be developed, into 'Device Definition Language'. Just as Component Definition 
Language defines a component, this defines the target device being built, and will contain 
such elements as which components are xased. 

In effect, the Device Definition Language defines the communications Tipeline* that is being 
20 developed. The Pipeline concept is important since most communications devices can be 
thought of as the process of moving information through a pipeline, performing transforms 
on the way. It is in effect an electronic assembly line, but rather than operate on parts of a 
car, it operates on items of data commonly called 'symbols*. Thus a radio signal would 
eventually be transformed to an audio signal. Of course, 'real' devices are often more 
25 complicated than a simple pipeline, and may have more than one pipeline, branches, or 
loops. The CVM development process allows a pipeline design to be tested before a full 
hardware version is ever built. This leads to shorter development times. 
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To fully define a target device, or pipeline, more information is needed. We also need a 
description of die resources (such as CPU rate) available on our target, and this is defined in 
a 'Conformance Scripting Language' and interconnects. We also need to know how each 
component is used (both physical and software APIs); this is achieved using 'Component 
API Specifications'. 

These three resources: the Device Definition Language, the Conformance Scripting 
Language, and the Component API Specifications, are now used within one of several 
possible CVMs: The first is the Instrumented Interpreted' (or, preferably. Instrumented and 
Compiled, which wiU perform more rapidly than an Instrumented Interpreted version) 
Pipeline Manager. This has some similarity to a software ICE. It reads die three resources 
and dien emulates the pipeline (emulation may be in real time); so if the target is a radio it 
then runs as a radio. Because of the Conformance Scripting Language it is able to simulate ' 
any botdenecks or resource limitations that would exist on the target device and is useful for 
development and de-bugging. In addition to running, die Instrumented Interpreted/ or 
Instrumented Compiled Pipeline Manager also outputs diagnostic information for each 
device - in Component Definition Language. This is important, since it can now be fed back 
into the development cycle and merged widi the original Component Definition Language 
descriptions to refine tiiat description. Hence, information on actual performance is made 
available to the designer before any hardware is constructed, and this is where the 
(substantial) development savings are made. This closes the inner loop of die development 
cycle. The Instrumented Interpreted or Instrumented Compiled Pipeline Manager 
incorporates run-time versions of die CVM core. It is possible for software elements of die 
Instrumented Interpreted or Instrumented Compiled Pipeline Manager to be replaced by 
hardware versions. (Ideally one at a time, so that bugs can be detected as they are 
introduced.) This is another development process enhancement This corresponds to the 2 
Phase Scheduling process (see above) involving die design time portioning of engmes across 
the computational substrate. 
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The second CVM is an Interpreted Pipeline Manager*. It is not instrumented, but in other 
regards is identical It may be used in development and debugging and by a manufacturer to 
produce a complete product. This is the third benefit: much of the work in writing a 
communications device is already done. It also incorporates run-time versions of the CVM 
core. 

The third CVM is a Tipeline Builder*. It can be thought of as a Compiled Non- 
Instrumented variant Like die other two it reads the diree resources, but instead of running 
it outputs computer source code, such as C, which can be compiled to produce a Pipeline 
implementation. For this reason it must have available to it CVM libraries. Testing this 
closes the outer loop of the development cycle. The overall approach of the CVM 
development cycle is shown schematically at Figures 8 and 9. 

In die prior art section of this specification, we acknowledged the eXpressDSP Real-Time 
Software Technology firom Texas Instruments Incorporated. The key advances possessed 
by the CVM will now be apparent to the skilled implementer. They include the following: 

• EXpressDSP is not a virtual machine layer as such. 

• CVM allows portability between various DSP platforms simply by porting the virtual 
machine; it is not tied to one platform (as the TI system is) 

• CVM includes integration with mathematical modelling 

• CVM allows the development of stacks using PC-based tools, not the less capable DSP- . 
based tools 

• CVM includes the ability to 'real time' prototype on the PC, moving module-by-module 
onto the target environment 

• CVM includes the ability to generate resource timings by running a standard code set, 
and then generate an 'architecture description' profile firom this 

• CVM allows development using high-level languages, since most of the 'high cycle' 
routines are akeady 'in die environment' of the CVM, which is oriented towards the 
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signal processing requirements of baseband communication engines rather than a genetic 
'real time software foundation' 

• CVM also includes the sort of data, dynamic resovirce, and buffer management 
commonly reqiiired for baseband DSP 

• CVM gives provision for a-priori resource prediction and concurrency analysis 

• CVM includes not merely functional elements (an API) but also the call stmcture (how 
the DSP code functions dynan[iically) as well as the full development paradigm support 
(from mathematical modellings resource modelling, through PC-based prototyping and 
finally end-target deployment) 

CVM allows the use of a third-party RTOS if desired, and can also operate without an 
RTOS if desired. 
CVM offers 2 Phase schediiling 

CVM enables advantages in migrating to ASICs and SoCs 

CVM offers runtime and design tools which are fully integrated yet platform 
independent 
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Appendix 1 

Examples of Cote Processes 

5 Signal Transforms and Frequency Domain Analysis 

• Signal Flow Graphs (SFG) 

• Discrete Frequency DFT 

• Windowing (Hamming, Manning etc.) 

10 Digital Filtering 

• Digital FIR Filters 

• Impulse Response 

• Frequency Response 

• FIR Low Pass Digital Filter 

15 • Infinite Impulse Response Digital Filters 

Adaptive Signal Processing 

• Components for Adaptive Signal Processing including Adaptive Digital Filters 

• Chaimel Identification 
20 • Echo Cancellation 

• Acoustic Echo Cancellation 

• Background Noise Suppression 

• Channel Equalisation 

• Adaptive Line Enhancement (ALE) 
25 • Adaptive Algorithms, including: 

• Minimising the Mean Squared Error 

• Adaptive Algorithm for FIR Filter 

• Mean Squared Error 

• Minimum Mean Squared Error Solution 
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• Wienex-Hopf Solution 

• Gradient Techniques 1 

• Gradient Techniques 2 

• The LMS Algorithm 

• Recursive Least Squares 

• Adaptive IIR Filtering 

• Gradient ICR. Filtering Techniques 

• Feintuch'sHRLMS 

• Equation Error LMS Algorithm 

• Directed Mode pDM) 

• Subband Adaptive Filter (SAF) Stmcture 

Multirate Signal Processing 

• Upsampling & Downsampling 

• Interpolating Low Pass Filter 

• Oversampling and Reconstrunction 

• Sigma-Ddta Processing Architecture 

• Subband Processing 

• M-Channel Filter Banks by Iteration 

• Modulated Filter Banks 

• Polyphase Filter Banks 

• QMF Filter Banks 

Audio Signal Source Coding 

• Lossless Huffman Coding/Decoding 

• Linear PCM 

• Companding 

• Adaptive Quantization Tools 

• Linear Predictive Coding 
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• Long-Term Prediction 

• Delta Modukrion (DM) 

• Differential PCM pPCM) 

• Adaptive DPCM (ADPCM) 
5 • LPC Vocoder 

• Code-Excited Linear Prediction (CELP) 

• Algebraic CELP (ACELP) 

• Subband Coding 

• Tools for Psychoacoustics 
10 • Spectral Masking 

• Temporal Masking 

• Precision Adaptive Subband Coding and bit Allocation and bit Stream Formatting tools 

Digital Modulation 

15 • XOR long an short code spreading/despreading 

• Amplitude Modulation 

• Quadrature Amplitude Modulation (QAM) 

• Quadrature Demodulation 

• Complex Quadrature Modulation 
20 • Complex Quadrature Demodulation 

• QPSK 

• n-PSK 

• M~ary Amplitude Shift Keying 

• 7t/nQPSK 

25 • Unipolar RZ and NRZ Signalling 

• Polar and Bipolar RZ and NRZ Signalling 

• Bandpass Shift Keying, including 

• Amplitude (On-OfQ Shift Keying 

• Binary Phase Shift Keying (BPSK) 
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• Frequency Shift Keying including 

• Bandpass Filtering for BPSK 

• Pulse Shaping including 

• Nyquist (Sine) Pulse Shaping 

• Raised Cosine Pulse Shaping 

• Root Raised Cosine Pulse Shaping 

Spread Spectrum Tools 

• Pseudo Random Code Generation 

• Gold Sequences 

• Kasaroi Sequences 

• Orthogonal Spreading Codes 

• Variable Length OC Generation 

• Orthogonal Walsh codes 

• Code Detection 

• Rake Receiver implementing 

• NBI Rejection Techniques including 

• Prediction filters 

• NBI rejection in Transform Domain 

• Decision feedback NBI rejection 

Tools for Management of Multiple Access & Detection 

• TDMA including 

• TDMA Frames 

• TDMA combined witii FDMA 

• CDMA, including 

• Direct Sequence (DS) CDMA 

• Power Control 

• Beamforming Tools 
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• Frequency Hopping CDMA 

• Multiuser Detection (MUD) 

• Multiple Access Interference Suppression 

• Decorrelator 

• Interference canceller 

• Adaptive MMSE 

• MMSE receiver training 

• Adaptive MMSE receiver DDM 



Mobile Channels 

• Rayleigh Fading Suppression mechanisms (Gaussian, Riceian) 

• Modelling and suppression tools, including: 

• Time spreading 

• Time spreading: coherence bandwidth 

• Time spreading: flat fading 

• Time spreading: Freq selective fading 

• Time variant behaviour of the channel 

• Doppler effect 



Channel Coding 

• Cyclic Coder 

• Reed Solomon Encoder 

• Convolutional Encoder 

• CE Puncturing 

• Interleaving 

• Convolutional Decoder 

• Viterbi Decoder (Hard and soft decision) 

• Turbo Codes 

• Turbo Encoding 
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• Turbo Decoding 
Equalisation 

• Adaptive Channel Equalisation 

• FIR Equaliser 

• Decision Feedback Equaliser 

• Direct conversion toolkit 

• QAM Analog RF/IF Architecture 

• QAM IF Downconversion support 

• Bandpass Sigma Delta support 

• Bandpass Sigma Delta to Baseband support 

• Bandpass and fs/4 Systems 

Signal Processing Library Functions 

This section describes some of the signal processing functions available with the CVM 

• Vector Manipulatbn Functions 

Estimates a normal, biased or unbiased auto-correlation of an input 
vector and stores die result in a second vector 
Computes tiie complex conjugate of a vector, the result can be 
returned in place or in a second vector. 
Returns the conjugate of a complex value. 

Computes the conjugate-symmetric extension of a vector in-place or 
in a new vector. 



AutoCorrelate 

Conjugate (vector) 

Conjugate (value) 
ExtendedConjugate 
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Exp 

InverseThreshold 

Threshold 

CrossCorrekte 

DotProduct 

ExtendedDotProd 

DownSample 

Max, 

Mean 

Min 

UpSample 

PowerSpectrum (1) 
PowerSpectrum (2) 



Add 
Subtract 
Multiply 
Divide 



Computes a vector where each element is e to the power of the 
corresponding element in the input vector. The result can be 
returned in place or in a second vector. 

Computes the inverse of the elements of a vector, with a threshold 
value. The result can be returned in place or in a second vector. 
Performs the threshold operation on a vector. The result can be 
returned in place or in a second vector. 

Estimates the cross-correlation of two vectors and stores the result 
in a tiaird vector. 

Computes a dot product of two vectors after applying the 

ExtendedConjucate operation to them. 

Computes a dot product of two conjugate-symmetric extended 

vectors. 

Down-samples a signal, conceptually decreasing its sampling rate by 

an integer factor. Returns the result in a second vector. 

Returns the maximum value in a vector. 

Computes the mean (average) of the elements in a vector. 

Returns the minimum value in a vector. 

Up-samples a signal, conceptually increasing its sampling rate by an 

integer factor. Returns the result in a second vector. 

Returns the power spectrum of a complex vector in a second vector. 

Computes the power spectrum of a complex vector whose real and 

imaginary components are two vectors. Stores the results in a third 

vector. 

Adds two vectors and stores the result in a third. 
Subtracts one vector from another and stores the result in a third. 
Multiplies two veaors and stores the result in a third. 
Divides one vector by another and stores the result in a third. 
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• Complex Vector Operations 



ImaginaryPart 
RealPart 
Magnitude (1) 

Magnitude (2) 



Phase (1) 
Phase (2) 



ComplexToPolar 



ComplexToPokr 
PolarToComplex 
PoIarToComplex 



Returns the imaginary part of a complex vector in a second vector. 
Returns the real part of a complex vector in a second vector. 
Computes the magnitudes of elements of a complex vector and 
stores the result in a second vector. 

This second version calculates the magnitudes of elements of the 
complex vector whose real and imaginary components are specified 
in individual real vectors and stores the result in a third vector. 
Returns the phase angles of elements of a complex vector in a 
second vector. 

Computes the phase angles of elements of the complex input vector 
whose real and imaginary components are specified in real and 
imaginary vectors, respectively. The function stores the resulting 
phase angles in a third vector. 

Converts the complex real/imaginary (Cartesian coordinate X/Y) 
pairs of individual input vectors to polar coordinate form. One 
version stores the magnitude (radius) component of each element in 
one vector and the phase (angle) component of each element in 
another vector. 

A second version returns the polar co-ordinates as (magnimde, 
phase) pairs in a singjie vector 

Converts the polar form (magnitude, phase) pairs stored in a vector 
into a complex vector. Returned in a second vector. 
Converts the polar form magnitude/phase pairs stored in tiie 
individual vectors into a complex vector. The function stores the 
real component of the result in a third vector and the imaginary 
component in a fourth vector. 
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PolarToComplex Convexts the polar fonn magnitude/phase pairs stored in two 

individual vectors into a complex vector. The function stores the 
real component of the result in a third vector and the imagmary 
component in a fourth vector. 



5 • Sample quantisation 

These methods convert between linear and nonlinear quantisation schemes. The number of 
bits used and the non linear parameters used can be varied. 

ALawToLinear Converts a vector of A-law encoded samples to linear samples. The 

result can be returned in place or in a second vector. 
10 linearToALaw Encodes a vector of linear samples using the A-law format. The 

result can be returned in place or in a second vector. 
LinearToMuLaw Encodes the linear samples ia a vector using the fi-law . The result 

can be returned in place or in a second vector. 
MviLawToLinear Converts a vector of 8-bit Ji-law encoded samples to the linear 
15 format. The result can be returned in place or in a second vector. 



• Sample-Generating Functiotis 



RandomGaussian Computes a vector of pseudo-random samples with a Gaussian 

distribution. 

Ini ri al is eTone Initialises a sinusoid generator with a given frequency, phase and 

20 magnitude. 

NextTone Produces the next sample of a sinusoid of frequency, phase and 

magnitude specified using InirialiseTone. 
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InitdaliseTriangle Initialises a triangle wave generator -with a given frequency, phase 

and magnitude. 

NextTriangle Produces the next sample of a triangle wave generated using the 

parameters in InitiaUseTriangjie. 



• Vl^dowing Functions 



BartlettWindow Multiplies a vector by a Bardett windowing function. The result is 

returned in a second vector. 

BlackmanWindow Multiplies a vector by a Blackman windowing function with a user- 
specified parameter. The result is returned in a second vector. 

HammingWindow Multiplies a vector by a Hamming windowing function. The result is 

returned in a second vector, 

HannWindow Multiplies a vector by a Hann windowing function. The result is 

returned in a second vector. 

KaiserWndow Multiplies a vector by a Kaiser windowing function. The result is 

returned in a second vector. 



• Convolution Functions 



Convolve Performs finite, linear convolution of two sequences. 

Convolve2D Performs finite, linear convolution of two two-dimensional signals. 

Filter2D Filters a two-dimensional signal similar to Convolve2D, but with the 

input and output arrays of the same size. 
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• Fourier Transfotm Functions 



Versions of these 
integer) formats. 
DiscreteFT 

InitialiseGoertz 
ResetGoertz 
Goert2FT(l) 
Goert2FT(2) 

FFT(l) 

FFr(2) 

FFT(3) 

FFT(4) 

FFT(5) 

IFFr(l) 
IFFr(2) 
IFFT(3) 



methods exist for a number of different data storage (fixed, floating and 

Computes a discrete Fourier transform in-place or in a second 
vector. 

Initialises the data used by Goert2el functions. 
Resets die internal delay line used by the Goertzel functions. 
Computes the DPT for a given frequency for a single signal count 
Computes the DPT for a given frequency for a block of successive 
signal counts. 

Computes a complex Fast Fourier Transform of a vector, either in- 
place or in a new vector. 

Computes a forward Fast Fourier Transform of two conjugate- 
symmetric signals, either in-place or in a new vector. 
Computes a forward Fast Fourier Transform of a conjugate- 
symmetric signal, either in-place or in a new vector. 
Computes a Fast Foxxrier Transform of a complex vector and 
returns the result in two separate (real and imaginary) vectors. 
Computes a Fast Fourier Transform of a complex vector provided 
as two separate (real and imagmary) vectors returns the result in two 
separate (teal and imaginary) vectors. 

Computes an inverse Fast Fourier Transform of a vector, either in- 
place or in a new vector. 

Computes an inverse Fast Fourier Transform of two conjugate- 
symmetric signals, either in-place or in a new vector. 
Computes an inverse Fast Fourier Transform of a conjugate- 
symmetric signal, either in-place or in a new vector. 
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• Finite Impulse Response Filter Functions 



InitialiseFIR 



FIR 



BlockFIR 



GetFERDelays 



10 GetFIRTaps 



SetFIRDelays 



15 



SetFIRTaps 

InitisliseMultiFIR 
MultiFIR . 

BIockMultiFIR 



20 



Initialises a low-level, single-rate finite impulse response filter with a 
set of delay line values and taps. 

Filters a single sample through a low-level, finite impulse response 
filter, previously configured using InitialiseFIR. 
Filters a block of samples through a low-level, finite impulse 
response filter. 

Gets the delay line values for a low-level, finite impulse response 
filter. 

Gets the tap coefficients for a low-level, finite impulse response 
filter. 

Changes the delay line values for a low-level, finite impulse response 
filter. 

Changes the tap coefficients for a low-level, finite impulse response 
filter. 

InitialLses a low-level, multi-rate finite impulse response filter. 
Filters a single sample through a low-level, multi-rate finite impulse 
response filter, previously configured using InitisliseMultiFIR. 
Filters a block of samples througjh a low-level, multi-rate finite 
impulse response filter, previously configured using 
InitisliseMultiFIR. 



• Least Mean Squares Adaptation Filter Functions 

InitialiseSALF Initial i s e a low-level, single-rate, adaptive FIR filter that uses the 

least mean squares (IMS) algorithm. 
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InitialiseMALF Initialise a low-level, multi-rate, adaptive FIR filter that uses the least 

mean squares (LMS) algorithm. 
InitALFDelay Initialises a delay line for a low-level, adaptive FIR filter that uses 

the least mean squares(LMS) algorithm. 
SALF Filter a sample through a low-level, single-rate, adaptive FIR filter 

that uses the least mean squares (LMS) algorithm. 
MALF Filter a sample through a low-level, multi-rate, adaptive FIR filter 

that uses the least mean squares (LMS) algorithm. 
SLF Filter a sample through a low-level, single-rate, adaptive FIR filter 

that uses the least mean squares (LMS) algorithm, but without 

adapting the filter for a secondary signal. 
MLF Filter a sample through a low-level, multi-rate, adaptive FIR filter 

that uses the least mean squares (LMS) algorithm, but without 

adapting the filter for a secondary signal. 
EnginesALF Filter a block of samples through a low-level, single-rate, adaptive 

FIR filter that uses the least mean squares (LMS) algorithm. 
BlockMALF Filter a block of samples through a low-level, multi-rate, adaptive 

FIR filter that uses the least mean squares (LMS) algorithm. 
EnginesLF Filter a block of samples through a low-level, singje-rate, adaptive 

FIR filter that uses the least mean squares (LMS) algorithm, but 

without adapting the filter for a secondary signal. 
BlockMLF Filter a block of samples through a low-level, multi-rate, adaptive 

FIR filter tiiat uses the least mean squares (LMS) algorithm, but 

without adapting the filter for a secondary signal. 
SetALFDelays Sets the delay line values for a low-level, adaptive FIR filter that uses 

the least mean squares (LMS) algorithm. 
SetALFLeaks Sets the leak values for a low-level, adaptive FIR filter that uses the 

least mean squares (LMS) algorithm. 
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SetALFSteps 

SetALFTaps 

GetALFDelays 

GetALFLeaks 

GetALFSteps 

GetALFTaps 



Sets the step values for a low-level, adaptive FIR Biter that uses he 
least mean squares (LMS) algorithm. 

Sets the taps coefficients for a low-level, adaptive FIR filter that uses 
the least mean squares (LMS) algorithro. 

Gets die delay line values for a low-level, adaptive FIR filter that 
uses the least mean squares (LMS) algorithm. 
Gets the leak values for a low-level, adaptive FIR filter that uses the 
least mean squares (LMS) algorithm. 

Gets the step values for a low-level, adaptive FIR filter that uses he 
least mean squares (LMS) algorithm. 

Gets the taps coefficients for a low-level, adaptive FIR filter that 
uses the least mean squares (LMS) algorithm. 



• Infibtiite Impulse Response Filter Functions 



InitialiseUR 

InitiaUseBiquadllR 

InitialiselERDelay 

nR 

BlockllR 



Initialises a low-level, infinite, impulse response filter of a specified 
order. 

Initialises a low-level, infinite impulse response (CDR) filter to 
reference a cascade of biquads (second-order IIR sections). 
Initialises die delay line for a low-level, infinite impulse response 
(IIR) filter. 

Filters a single sample through a low-levd, infinite impulse response 
filter. 

Filters a block of samples through a low-level, infinite impulse 
response filter. 
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• Wavelet Functions 

DecomposeWavelet Decomposes signals into wavelet series. 
ReconstructWavelet Reconstructs signals from wavelet decomposition. 

• Discrete Cosine Transform Function 

I^CT Performs the Discrete Cosine Transform (DCI). 

• Vector Data Conversion Functions 

All the functions described in this section can operate on a number of different data formats 
(such as various integer lengths, different floating point formats and fixed point 
representations of floating point numbers). The Signal Processing library will contain 
metiiods to translate single values and vectors between all pairs of formats supported. 
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CLAIMS 



1. A method of designings modelling or fabjicating a communications baseband stack, 
comprising the steps o£ 

(a) creating a description of one or more of the following parameters of the 
baseband stack: 

(i) resource requirements; 
(n) capabilities; 
Qii) ' behaviour; and 

(b) using diat description as an input to software comprising a virtual machine 
layer optimised for a communications DSP in order to generate an emulation of the 
baseband stack to be designed, modelled or fabricated.. 



The method of Claim 1 comprising the steps of: 

(a) using, for one or more components to be incorporated in die baseband stack, 
a component description which deOnes some or aU of die externally visible attributes 
of a component, as well as its behaviour, as an input to a matiiematical modelling 
tool programmed to output component related performance data for each 
component; 

(b) processing the component related performance data for each component to 
yield a baseband slack description; 

(c) creating a resources description defining die resources of die baseband stack; 

(d) creating an interfece description defining how each component is to be used 
in the baseband stack; and 

(e) using each of the baseband stack description, the resources description, and 
the interface description as die inputs to the software. 



3. The metiiod of Claim 2 in which the software emulates the baseband stack and is 
botii instrumented and interpreted/compiled. 
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4. The method of Claim 3, in which the software outputs diagnostic infonnation in 
respect of a component in the same format as the component description for that 
component in order to refine the quality of the component description. 

5. The method of Claim 4 in which the diagnostic information in the component 
description is fed back as an input to the software to improve the accuracy of the modelling. 

6. The method of Claim 2 and any claim dependent on Claim 2 where the software 
outputs computer source code which can be interpreted or compiled to fabricate an actual 
baseband stack implementation. 

7. The method of any preceding claim in which components or modules of the 
baseband stack can be incrementally ported to a target DSP to enable testing and debugging 
of individual ported components or modules. 

8. The method of any preceding Claim in which: 

(a) a first test is carried out using software to emulate a given hardware 
component as part of a design or modelling process; 

(b) the emulated component is replaced with die hardware component, and 

(c) a further test is carried out. 

9. The method of Claim 1 in which the virtual machine layer allows statistical modelling 
in which available resources and interconnect characteristics are represented as statistical 
distdbution functions. 

10. The method of Claim 1 in which the virtual machine layer allows low MIPS code to 
interface widi high MIPS processes by using APIs presented by the virtual machine layer. 



wo 01/53999 



PCT/GBOl/00278 



61 



11. The method of Claim 10 in which the high MIPS processes are implementations of 
abstract processes and are organised in a runtime environment in such a way that access cost 
is optimised. 

12. The method of Claim 1 in which the virtual machine layer comprises a scheduler 
which is programmed to co-schedule processes between different engines in order to give 
optimal resource utQisation during either or bodi of (i) the design and modelling phase and 
(S) a runtime, and in which the resource allocation involves one or both of the following 
steps: (a) measurement using a statistical function; (b) modelling using a statistical 
distribution function. 

13. The method of Claim 12 in which the virtual machine layer supports underlying high 
MIPs algorithms common to a number of different baseband processing algorithms, and 
makes these accessible to high level, architecture neutral, potentially high complexity but 
low-MIPs control flows dirough a scheduler interface, which allows the control flow to 
specify the algorithm to be executed, together with a set of resource constraint envelopes, 
relating to one or more of: time of execution, memory, interconnect bandwidth, inside of 
which the caller desires the execution to take place. 

14. The method of Claim 12 adapted to allow, dviring design or modelling, datapath 
parrioning of high MIPS processes across different engines. 

15. The method of Claim 14 in which the scheduler is awate, during runtime, of the 
datapath pardoning decisions made across different engines. 

16. The method of Claim 10 in which the low MIPS complex code is expressed at least 
in part in a language not designed for real time operations. 



17. The method of Claim 16 in which the language is SDL 
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18. The method of Claim 10 which enables the low MIPS complex code to be 
represented in an architecture neutral manner. 

5 19. The method of Claim 10 which enables a baseband stack to be constructed with 
architecture neutral, low MIPS control codes, in which the control codes use a set of 
architecture neutral APIs specified by the virtual machine layer in order to access 
architecture specific high MIPS processes. 

10 20. The method of Claim 1 9 in which at least one high MIPS engine provides a resource 
for several different kinds of baseband stack. 

21. The method of Claim 10 programmed to characterise the static and dynamic 
resource requirements of different processes so that they can be co-scheduled in real-time 

15 with other processes. 

22. The method of Claim 21 further comprising fiiUy integrated mathematical models, 
statistical simulation tools and a priori pardoning simulation tools. 

20 23. The method of any preceding operating as a design or modelling platform for a 
system on a chip. 

24. The mediod of Claim 23, in which intellectual property blocks, each firom several 
different vendors, can be combined in the system on a chip by virtue of the static and 
25 dynamic resource requirements of each block being modelled by the software so that 
multiple blocks can be co-scheduled together in real-time. 



25. 



The method of Claim 24 in which the blocks perform high MIPS operations. 
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26. The method of Claim 24 in which the blocks perfomi low MIPS, control operations. 

27. The method of Claim 9 as used in a process of migrating the substrate on which 
digital signal processing is performed from (a) a PC prototype for non-real time design and 
modelling to (b) one or more DSP chips with one or more external FPGAs for rantime. 

28. The method of Claim 27 in which the substrate is subsequently migrated to a custom 
ASIC. 



29. The method of Claim 10 in which the virmal machine layer is programmed with or 
enables access to one or more of the following: 

(a) core processes; 

(b) core stmctures; 

(c) core functions: 

(d) flow control: 

(e) state management. 

30. The method of Claim 29 in which the core processes include algorithms to perform 
one or more of the following: source coding, channel coding, modulation, or their inyerses, 
namely source decoding, channel decoding and demodulation. 

31. The method of Claim 29 in which the core structures comprise a symbol processing 
section (concerned witii processing fuU symbols, regardless of whether all die information 
held within that symbol is to be used) and a data directed processing section, in which only 
those bits which hold rdcvmt information are processed. 

32. The method of Claim 31 in which die core stmcture is comprised of processing 
modules operable to aUocate, share and dispose of intermediate, aligned memory buffers, 
and pass events between themselves. 
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33. The method of Claim 29 in which the core functions include one or more of the 
following: resource allocation and scheduling, including memory allocation, real time 
resource allocation and concurrency management. 

5 

34. The method of Claim 29 operable to access PC debug tools. 

35. The method of Claim 29 which is operable with a component, in which only that 
information necessary to enable the software to operate with and/or otherwise model the 

10 performance of die component is supplied by the owner of the intellectual property in the 
component 

36. The method of Claim 29 which is operable with a standardised description of the 
characteristics (including interface and non-interface behaviour) of communications 

15 components to enable a simulator, emulator or modelling tool to accurately estimate the 
resource requirements of a system using those components. 

37. The method of Claim 29 operable to model time, CPU, memory, interconnect 
scheduling and concurrency restraints, enabling mapping onto a real time OS, non real-time 

20 OS, virtual machine or hardware. 

38. A baseband stack developed using the method of any preceding claim. 

39. A communications device using the baseband stack of riaim 38. 

25 

40. A system on a chip developed using the method of any preceding Claim 1-37. 

41. A method of defining a component using a standardised description of the 
characteristics (including interface and/or non-interface behaviour) of that component 
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whereby that standardised description can be used in a method of Claim 1 or constitute the 
component description of Claim 2 and any preceding claim dependent on Claim 2. 

42. A method of defining a baseband stack using a language designed to define some or 
aU of the functionality of the stack to estimate, simulate or fabricate a real stack using the 
method of any preceding Claim 1 - 37. 
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